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PROTEIN SCAFFOLDS FOR ANTIBODY MIMICS 
AND OTHER BINDING PROTEINS 

Cross Reference to Related Applications 
This application claims the benefit of the filing date of provisional 
application, U.S.S.N. 60/111,737, filed December 10, 1998, and utility application, 
U.S.S.N. 09/456,693, filed December 9, 1999. 

Background of the Invention 

This invention relates to protein scaffolds useful, for example, for the 
generation of products having novel binding characteristics. 

Proteins having relatively defined three-dimensional structures, 
commonly referred to as protein scaffolds, may be used as reagents for the design 
of engineered products. These scaffolds typically contain one or more regions 
which are amenable to specific or random sequence variation, and such sequence 
randomization is often carried out to produce libraries of proteins fi-om which 
desired products may be selected. One particular area in which such scaffolds are 
useful is the field of antibody design. 

A number of previous approaches to the manipulation of the mammalian 
immune system to obtain reagents or drugs have been attempted. These have 
included injecting animals with antigens of interest to obtain mixtures of 
polyclonal antibodies reactive against specific antigens, production of monoclonal 
antibodies in hybridoma cell culture (Koehler and Milstein, Nature 256:495, 



1975), modification of existing monoclonal antibodies to obtain new or optimized 
recognition properties, creation of novel antibody fragments with desirable binding 
characteristics, and randomization of single chain antibodies (created by 
connecting the variable regions of the heavy and light chains of antibody 
5 molecules with a flexible peptide linker) followed by selection for antigen binding 
by phage display (Clackson et al., Nature 352:624, 1991). 

In addition, several non-immunoglobulin protein scaffolds have been 
proposed for obtaining proteins with novel binding properties. For example, a 
"minibody" scaffold, which is related to the immunoglobulin fold, has been 
10 designed by deleting three beta strands from a heavy chain variable domain of a 
u monoclonal antibody (Tramontano et al., J. Mol. Recognit. 7:9, 1994). This 

i=* protein includes 61 residues and can be used to present two hypervariable loops. 

i=y These two loops have been randomized and products selected for antigen binding, 

f 1 but thus far the framework appears to have somewhat limited utility due to 

% 15 solubility problems. Another framework used to display loops has been 

tendamistat, a 74 residue, six-strand beta sheet sandwich held together by two 
:j3 disulfide bonds (McConnell and Hoess, J. Mol. Biol. 250:460, 1995). This 

7 TIP? 

□ scaffold includes three loops, but, to date, only two of these loops have been 

examined for randomization potential. 

20 Other proteins have been tested as frameworks and have been used to 

display randomized residues on alpha helical surfaces (Nord et al., Nat. 
Biotechnol. 15:772, 1997; Nord et al, Protein Eng. 8:601, 1995), loops between 
alpha heUces in alpha helix bundles (Ku and Schultz, Proc. Natl. Acad. Sci. USA 
92:6552, 1995), and loops constrained by disulfide bridges, such as those of the 

25 small protease inhibitors (Markland et al.. Biochemistry 35:8045, 1996; Markland 
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et al., Biochemistry 35:8058, 1996; Rottgen and Collins, Gene 164:243, 1995; 
Wang et al., J. Biol. Chem. 270:12250, 1995). 



Summary of the Invention 
The present invention provides a new family of proteins capable of 
5 evolving to bind any compound of interest. These proteins, which make use of a 
fibronectin or fibronectin-like scaffold, function in a manner characteristic of 
natural or engineered antibodies (that is, polyclonal, monoclonal, or single-chain 
antibodies) and, in addition, possess stiiictural advantages. Specifically, the 
structure of these antibody mimics has been designed for optimal folding, stability, 
3 10 and solubility, even under conditions which normally lead to the loss of stiiicture 

and function in antibodies. 
m These antibody mimics may be utilized for the purpose of designing 

% proteins which are capable of binding to virtually any compound (for example, any 

;U. protein) of interest. In particular, the fibronectin-based molecules described herein 

[H 1 5 may be used as scaffolds which are subjected to directed evolution designed to 
=3 randomize one or more of the three fibronectin loops which are analogous to the 

Q complementarity-determining regions (CDRs) of an antibody variable region. 

Such a directed evolution approach results in the production of antibody-like 
molecules with high affinities for antigens of interest. In addition, the scaffolds 
20 described herein may be used to display defined exposed loops (for example, loops 
previously randomized and selected on the basis of antigen binding) in order to 
direct the evolution of molecules that bind to such introduced loops. A selection 
of this type may be carried out to identify recognition molecules for any individual 
CDR-like loop or, alternatively, for the recognition of two or all three CDR-like 
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loops combined into a non-linear epitope. 

Accordingly, the present invention features a protein that includes a 
fibronectin type III domain having at least one randomized loop, the protein being 
characterized by its ability to bind to a compound that is not bound by the 
corresponding naturally-occurring fibronectin. 

In preferred embodiments, the fibronectin type III domain is a 
mammalian (for example, a human) fibronectin type III domain; and the protein 
includes the tenth module of the fibronectin type III ('^Fn3) domain. In such 
proteins, compound binding is preferably mediated by either one, two, or three 
'*^Fn3 loops. In other preferred embodiments, the second loop of ^°Fn3 may be 
extended in length relative to the naturally-occurring module, or the ^^¥n3 may 
lack an integrin-binding motif. In these molecules, the integrin-binding motif may 
be replaced by an amino acid sequence in which a basic amino acid-neutral amino 
acid-acidic amino acid sequence (in the N-terminal to C-terminal dhection) 
replaces the integrin-binding motif; one preferred sequence is serine-glycine- 
glutamate. In another preferred embodiment, the fibronectin type III domain- 
containing proteins of the invention lack disulfide bonds. 

Any of the fibronectin type III domain-containing proteins described 
herein may be formulated as part of a fiision protein (for example, a fiision protein 
which fiirther includes an immunoglobulin domain, a complement protein, a 
toxin protein, or an albumm protein). In addition, any of the fibronectin type III 
domain proteins may be covalently bound to a nucleic acid (for example, an 
RNA), and the nucleic acid may encode the protein. Moreover, the protein may be 
a multimer, or, particularly if it lacks an integrin-binding motif, it may be 
formulated in a physiologically-acceptable carrier. 



The present invention also features proteins that include a fibronectin 
type III domain having at least one mutation in a P-sheet sequence which changes 
the scaffold structure. Again, these proteins are characterized by their ability to 
bind to compounds that are not bound by the corresponding naturally-occurring 
fibronectin. 

In addition, any of the fibronectin scaffolds of the invention may be 
immobilized on a solid support (for example, a bead or chip), and these scaffolds 
may be arranged in any configuration on the sohd support, including an array. 

In a related aspect, the invention ftirther features nucleic acids encoding 
any of the proteins of the invention. In preferred embodiments, the nucleic acid is 
DNA or RNA. 

In another related aspect, the invention also features a method for 
generating a protein which includes a fibronectin type III domain and which is 
pharmaceutically acceptable to a mammal, involving removing the integrin- 
binding domain of said fibronectin type III domain. This method may be applied 
to any of the fibronectin type III domain-containing proteins described above and 
is particularly usefiil for generating proteins for human therapeutic apphcations. 
The invention also featiires such fibronectin type III domain-containing proteins 
which lack integrin-binding domains. 

In yet other related aspects, the invention features screening methods 
which may be used to obtain or evolve randomized fibronectin type III proteins 
capable of binding to compounds of interest, or to obtain or evolve compounds (for 
example, proteins) capable of binding to a particular protein containing a 
randomized fibronectin type III motif. Li addition, the invention features 
screening procedures which combine these two methods, in any order, to obtain 



either compounds or proteins of interest. 

In particular, the first screening method, useful for the isolation or 
identification of randomized proteins of interest, involves : (a) contacting the 
compound with a candidate protein, the candidate protein including a fibronectin 

5 type III domain having at least one randomized loop, the contacting being carried 
out under conditions that allow compound-protein complex formation; and (b) 
obtaining, from the complex, the protein which binds to the compound. 

The second screening method, for isolating or identifying a compound 
which binds to a protein having a randomized fibronectin type III domain, 

1 0 involves: (a) contacting the protein with a candidate compound, the contacting 
being carried out under conditions that allow compound-protein complex 
formation; and (b) obtaining, fi:om the complex, the compound which binds to the 
protein. 

In preferred embodiments, the methods further involve either 
1 5 randomizing at least one loop of the fibronectin type III domain of the protein 
obtained in step (b) and repeating steps (a) and (b) using the fiirther randomized 
protein, or modifying the compound obtained in step (b) and repeating steps (a) 
and (b) using the further modified compound. In addition, the compound is 
preferably a protein, and the fibronectin type III domain is preferably a 
20 mammalian (for example, a human) fibronectin type III domain. In other preferred 
embodiments, the protein includes the tenth module of the fibronectin type III 
domain ('°Fn3), and binding is mediated by one, two, or three ^''FnS loops. In 
addition, the second loop of ^'^FnS may be extended in length relative to the 
naturally-occurring module, or ^"FnS may lack an integrin-binding motif. Again, 
25 as described above, the integrin-binding motif may be replaced by an amino acid 
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sequence in which a basic amino acid-neutral amino acid-acidic amino acid 
sequence (in the N-terminal to C-terminal direction) replaces the integrin-binding 
motif; one preferred sequence is serine-glycine-glutamate. 

The selection methods described herein may be carried out using any 

5 fibronectin type III domain-containing protein. For example, the fibronectin type 
III domain-containing protein may lack disulfide bonds, or may be formulated as 
part of a fusion protein (for example, a fusion protein which further includes an 
immunoglobulin F,. domain, a complement protein, a toxin protein, or an albumin 
protein). In addition, selections may be carried out using the fibronectin type III 

1 0 domain proteins covalently bound to nucleic acids (for example, RNAs or any 
nucleic acid which encodes the protein). Moreover, the selections may be carried 
out using fibronectin domain-containing protein multimers. 

Preferably, the selections involve the immobilization of the binding 
target on a solid support. Preferred solid supports include columns (for example, 

1 5 affinity columns, such as agarose columns) or microchips. 

In addition, the invention features diagnostic methods which employ the 
fibronectin scaffold proteins of the invention. Such diagnostic methods may be 
carried out on a sample (for example, a biological sample) to detect one analyte or 
to simultaneously detect many different analytes in the sample. The method may 

20 employ any of the scaffold molecules described herein. Preferably, the method 
involves (a) contacting the sample with a protein which binds to the compound 
analyte and which includes a fibronectin type III domain having at least one 
randomized loop, the contacting being carried out under conditions that allow 
compound-protein complex formation; and (b) detecting the complex, and 

25 therefore the compound in the sample. 
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In preferred embodiments, the protein is immobilized on a solid support 
(for example, a chip or bead) and may be immobilized as part of an array. The 
protein may be covalently bound to a nucleic acid, preferably, a nucleic acid, such 
as RNA, that encodes the protein. In addition, the compound is often a protein, but 
may also be any other analyte in a sample. Detection may be accompUshed by any 
standard technique including, without limitation, radiography, fluorescence 
detection, mass spectroscopy, or surface plasmon resonance. 

As used herein, by "fibronectin type III domain" is meant a domain 
having 7 or 8 beta strands which are distributed between two beta sheets, which 
themselves pack against each other to form the core of the protein, and further 
containing loops which cormect the beta strands to each other and are solvent 
exposed. There are at least three such loops at each edge of the beta sheet 
sandwich, where the edge is the boundary of the protein perpendicular to the 
direction of the beta strands. Preferably, a fibronectin type III domain includes a 
sequence which exhibits at least 30% amino acid identity, and preferably at least 
50% amino acid identity, to the sequence encoding the structure of the ^°Fn3 
domain referred to as "Ittg" (ID = "Ittg" (one ttg)) available from the Protein Data 
Base. Sequence identity referred to in this definition is determined by the 
Homology program, available from Molecular Simulation (San Diego, CA). The 
invention further includes polymers of ^"FnB-related molecules, which are an 
extension of the use of the monomer structure, whether or not the subunits of the 
polyprotein are identical or different in sequence. 

By "naturally occurring fibronectin" is meant any fibronectin protein 

that is encoded by a living organism. 



By "randomized" is meant including one or more amino acid alterations 

relative to a template sequence. 

By a "protein" is meant any sequence of two or more amino acids, 
regardless of length, post-translation modification, or function. "Protein" and 
5 "peptide" are used interchangeably herein. 

By "RNA" is meant a sequence of two or more covalently bonded, 
naturally occurring or modified ribonucleotides. One example of a modified RNA 
included within this term is phosphorothioate RNA. 

By "DNA" is meant a sequence of two or more covalently bonded, 
10 naturally occurring or modified deoxyribonucleotides. 

By a "nucleic acid" is meant any two or more covalently bonded 
nucleotides or nucleotide analogs or derivatives. As used herein, this term 
includes, without Hmitation, DNA, RNA, and PNA. 

By "pharmaceutically acceptable" is meant a compound or protein that 
1 5 may be administered to an animal (for example, a mammal) without significant 

adverse medical consequences. 

By "physiologically acceptable carrier" is meant a carrier which does 
not have a significant detrimental impact on the treated host and which retains the 
therapeutic properties of the compound with which it is administered. One 
20 exemplary physiologically acceptable carrier is physiological saline. Other 

physiologically acceptable carriers and their formulations are known to one skilled 
in the art and are described, for example, in Remington's Pharmaceutical Sciences, 
(18"^ edition), ed. A. Gennaro, 1990, Mack Publishing Company, Easton, PA, 

incorporated herein by reference. 
25 By "selecting" is meant substantially partitioning a molecule from other 
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molecules in a population. As used herein, a "selecting" step provides at least a 2- 
fold, preferably, a 30-fold, more preferably, a 100-fold, and, most preferably, a 
1000-fold enrichment of a desired molecule relative to undesired molecules in a 
population following the selection step. A selection step may be repeated any 
5 number of times, and different types of selection steps may be combined in a given 
approach. 

By "binding partner," as used herein, is meant any molecule which has a 
specific, covalent or non-covalent affinity for a portion of a desired compound (for 
example, protein) of interest. Examples of binding partners include, without 
10 limitation, members of antigen/antibody pairs, protein/inhibitor pairs, 
w receptor/ligand pairs (for example cell surface receptor/ligand pairs, such as 

M hormone receptor/peptide hormone pairs), enzyme/substrate pairs (for example, 

m kinase/substrate pairs), lectin/carbohydrate pairs, oligomeric or heterooligomeric 

7i protein aggregates, DNA binding protein/DNA binding site pairs, RNA/protein 

n 15 pairs, and nucleic acid duplexes, heteroduplexes, or ligated strands, as well as any 
molecule which is capable of forming one or more covalent or non-covalent bonds 
(for example, disulfide bonds) with any portion of another molecule (for example, 

■= 3 a compound or protein) . 

By a "soUd support" is meant, without limitation, any column (or 
20 column material), bead, test tube, microtiter dish, solid particle (for example, 

agarose or sepharose), microchip (for example, silicon, silicon-glass, or gold chip), 
or membrane (for example, the membrane of a liposome or vesicle) to which a 
fibronectin scaffold or an affinity complex may be bound, either directly or 
indirectly (for example, through other binding partner intermediates such as other 
25 antibodies or Protein A), or in which a fibronectin scaffold or an affinity complex 
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may be embedded (for example, through a receptor or chamiel). 

The present invention provides a number of advantages. For example, 
as described in more detail below, the present antibody mimics exhibit improved 
biophysical properties, such as stability under reducing conditions and solubility at 
high concentrations. In addition, these molecules may be readily expressed and 
folded in prokaryotic systems, such as E. coH, in eukaryotic systems, such as yeast, 
and in m vitro translation systems, such as the rabbit reticulocyte lysate system. 
Moreover, these molecules are extremely amenable to affinity maturation 
techniques involving multiple cycles of selection, including in vitro selection using 
RNA-protein fusion technology (Roberts and Szostak, Proc. Natl. Acad. Sci USA 
94:12297, 1997; Szostak et al., U.S.S.N. 09/007,005 andU.S.S.N. 09/247,190; 
Szostak et al. WO98/31700), phage display (see, for example. Smith and Petrenko, 
Chem. Rev. 97:317, 1997), and yeast display systems (see, for example, Boder and 
Wittrup, Nature Biotech. 15:553, 1997). 

Other features and advantages of the present invention will be apparent 
from the following detailed description thereof, and from the claims. 

Brief Description of the Drawings 
FIGURE 1 is a photograph showing a comparison between the 

structures of antibody heavy chain variable regions from camel (dark blue) and 

llama (light blue), in each of two orientations. 

FIGURE 2 is a photograph showing a comparison between the 

structures of the camel antibody heavy chain variable region (dark blue), the llama 

antibody heavy chain variable region (light blue), and a fibronectin type III module 
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number 10 (^''FnS) (yellow). 

FIGURE 3 is a photograph showing a fibronectin type III module 
number 10 (^''FnS), with the loops corresponding to the antigen-binding loops in 
IgG heavy chains highlighted in red. 
5 FIGURE 4 is a graph illustrating a sequence ahgnment between a 

fibronectin type III protein domain and related protein domains. 

FIGURE 5 is a photograph showing the structural similarities between a 
'°Fn3 domain and 15 related proteins, including fibronectins, tenascins, collagens, 
and undulin. In this photograph, the regions are labeled as follows: constant, dark 
10 blue; conserved, hght blue; neutral, white; variable, red; and RGB integrin-binding 

motif (variable), yellow. 

FIGURE 6 is a photograph showing space filling models of fibronectin 
III modules 9 and 10, in each of two different orientations. The two modules and 
the integrin binding loop (RGB) are labeled. In this figure, blue indicates 
1 5 positively charged residues, red indicates negatively charged residues, and white 

indicates uncharged residues. 

FIGURE 7 is a photograph showing space filling models of fibronectin 
III modules 7-10, in each of three different orientiations. The four modules are 
labeled. In this figure, blue indicates positively charged residues, red indicates 
20 negatively charged residues, and white indicates uncharged residues. 

FIGURE 8 is a photograph illustrating the formation, under different sah 
conditions, of RNA-protein fiisions which include fibronectin type III domains. 

FIGURE 9 is a series of photographs illustrating the selection of 
fibronectin type III domain-containing RNA-protein fusions, as measured by PGR 
25 signal analysis. 
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FIGURE 10 is a graph illustrating an increase in the percent TNF-a 
binding during the selections described herein, as well as a comparison between 
RNA-protein fusion and free protein selections. 

FIGURE 11 is a series of schematic representations showing IgG, ^^Fn3, 
5 Fn-CH,-CH2-CH3, and Fn-CHj-CHj (clockwise from top left). 

FIGURE 12 is a photograph showing a molecular model of Fn-CHi- 
CH2-CH3 based on known three-dimensional structures of IgG (X-ray 
crystallography) and ^''FnS (NMR and X-ray crystallography). 

FIGURE 13 is a graph showing the time course of an exemplary 
10 I'^FnB-based nucleic acid-protein fiision selection of TNF-a bmders. The 

proportion of nucleic acid-protein fiision pool (open diamonds) and fi-ee protein 
pool (open circles) that bound to TNF-a-Sepharose, and the proportion of free 
protein pool (fiill circles) that bound to underivatized Sepharose, are shown. 

FIGURES 14 and 15 are graphs illustrating TNF-a binding by TNF-a 
1 5 Fn-binders. In particular, these figures show mass spectra data obtained fi-om a 
^°Fn3 fusion chip and non-fusion chip, respectively. 

FIGURES 16 and 17 are the phosphorimage and fluorescence scan, 

respectively, of a '°Fn3 array, illustrating TNF-a binding. 

Detailed Description 

20 The novel antibody mimics described herein have been designed to be 

superior both to antibody-derived fi-agments and to non-antibody frameworks, for 
example, those frameworks described above. 

The major advantage of these antibody mimics over antibody fragments 
is structural. These scaffolds are derived from whole, stable, and soluble structural 



-13- 



modules found in human body fluid proteins. Consequently, they exhibit better 
folding and thermostability properties than antibody fragments, whose creation 
involves the removal of parts of the antibody native fold, often exposing amino 
acid residues that, in an intact antibody, would be buried in a hydrophobic 
5 environment, such as an interface between variable and constant domains. 
Exposure of such hydrophobic residues to solvent increases the likelihood of 
aggregation. 

In addition, the antibody mimics described herein have no disulfide 
bonds, which have been reported to retard or prevent proper folding of antibody 
10 fragments under certain conditions. Since the present scaffolds do not rely on 
;=3 disulfides for native fold stability, they are stable under reducing conditions, unlike 

antibodies and their fragments which unravel upon disulfide bond breakdown. 
rLl Moreover, these fibronectin-based scaffolds provide the functional 

7 J advantages of antibody molecules. In particular, despite the fact that the Fn3 

;U 15 module is not an immunoglobulin, its overall fold is close to that of the variable 
region of the IgG heavy chain (Figure 2), making it possible to display the three 
;=3 fibronectin loops analogous to CDRs in relative orientations similar to those of 

□ native antibodies. Because of this structure, the present antibody mimics possess 

antigen binding properties that are similar in nature and affinity to those of 
20 antibodies, and a loop randomization and shuffling strategy may be employed in 
vitro that is similar to the process of affinity maturation of antibodies in vivo. 

There are now described below exemplary fibronectin-based scaffolds 
and their use for identifying, selecting, and evolving novel binding proteins as well 
as their target ligands. These examples are provided for the purpose of illustrating, 
25 and not limiting, the invention. 

- 14- 



%n3 Structural Motif 

The antibody mimics of the present invention are based on the structure 
of a fibronectin module of type III (Fn3), a common domain found in mammalian 
blood and structural proteins. This domain occurs more than 400 times in the 
protein sequence database and has been estimated to occur in 2% of the proteins 
sequenced to date, including fibronectins, tenscin, intracellular cytoskeletal 
proteins, and prokaryotic enzymes (Bork and Doolittle, Proc. Natl. Acad. Sci. USA 
89:8990, 1992; Bork et al., Nature Biotech. 15:553, 1997; Meinke et al, J. 
Bacteriol. 175:1910, 1993; Watanabe et ah, J. Biol Chem. 265:15659, 1990). In 
particular, these scaffolds include, as templates, the tenth module of human Fn3 
(^^Fn3), which comprises 94 amino acid residues. The overall fold of this domain 
is closely related to that of the smallest functional antibody fragment, the variable 
region of the heavy chain, which comprises the entire antigen recognition unit in 
camel and llama IgG (Figure 1,2). The major differences between camel and 
llama domains and the ^^Fn3 domain are that (i) ^^Fn3 has fewer beta strands 
(seven vs. nine) and (ii) the two beta sheets packed against each other are 
connected by a disulfide bridge in the camel and llama domains, but not in ^^Fn3. 

The three loops of ^^Fn3 corresponding to the antigen-binding loops of 
the IgG heavy chain mn between amino acid residues 21-31, 51-56, and 76-88 
(Figure 3). The length of the first and the third loop, 1 1 and 12 residues, 
respectively, fall within the range of the corresponding antigen-recognition loops 
found in antibody heavy chains, that is, 10-12 and 3-25 residues, respectively. 
Accordingly, once randomized and selected for high antigen affinity, these two 
loops make contacts with antigens equivalent to the contacts of the corresponding 
loops in antibodies. 
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In contrast, the second loop of ^''FnS is only 6 residues long, whereas the 
corresponding loop in antibody heavy chains ranges from 16-19 residues. To 
optimize antigen binding, therefore, the second loop of ^°Fn3 is preferably 
extended by 10-13 residues (in addition to being randomized) to obtain the greatest 
5 possible flexibility and affinity in antigen binding. Indeed, in general, the lengths 
as well as the sequences of the CDR-like loops of the antibody mimics may be 
randomized during in vitro or in vivo affinity maturation (as described in more 
detail below). 

The tenth human fibronectin type III domain, ^"FnS, refolds rapidly 
10 even at low temperature; its backbone conformation has been recovered within 1 
^5 second at 5''C. Thermodynamic stability of ^^FnB is high (AGu = 24 kJ/mol = 5.7 

1= * kcal/mol) , correlating with its high melting temperature of 1 1 0°C . 

m One of the physiological roles of ^^Fn3 is as a subunit of fibronectin, a 

'fi glycoprotein that exists in a soluble form in body fluids and in an insoluble form in 

U 1 5 the extraceUular matrix (Dickinson et al., J. Mol. Biol. 236:1079, 1994). A 
j,^: fibronectin monomer of 220-250 kD contains 1 2 type I modules, two type II 

=3 modules, and 17 fibronectin type III modules (Potts and Campbell, Curr. Opin.Cell 

F ^ 

□ Biol. 6:648, 1994). Different type III modules are involved in the binding of 

fibronectin to integrins, heparin, and chondroitin sulfate. ^°Fn3 was found to 
20 mediate cell adhesion through an integrin-binding Arg-Gly-Asp (ROD) motif on 
one of its exposed loops. Similar ROD motifs have been shown to be involved in 
integrin binding by other proteins, such as fibrinogen, von Wellebrand factor, and 
vitronectin (Hynes et al.. Cell 69:1 1, 1992). No other matrix- or cell-binding roles 
have been described for '*^Fn3 . 
25 The observation that '"FnB has only sHghtly more adhesive activity than 
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a short peptide containing RGD is consistent with the conclusion that the 
cell-binding activity of "'FnS is localized in the RGD peptide rather than 
distributed throughout the ^°Fn3 structure (Baron et al., Biochemistry 31:2068, 
1992). The fact that '^Fn3 without the RGD motif is unlikely to bind to other 
5 plasma proteins or extracellular matrix makes '°Fn3 a useful scaffold to replace 
antibodies. In addition, the presence of ^°Fn3 in natural fibrinogen in the 
bloodstream suggests that ^*'Fn3 itself is unlikely to be immunogenic in the 

organism of origin. 

In addition, we have determined that the ^''FnS firamework possesses 

10 exposed loop sequences tolerant of randomization, facilitating the generation of 
diverse pools of antibody mimics. This determination was made by examining the 
flexibility of the '°Fn3 sequence. In particular, the human ^''Fn3 sequence was 
ahgned with the sequences of fibronectins from other sources as well as sequences 
of related proteins (Figure 4), and the results of this alignment were mapped onto 

15 the three-dimensional structure of the human ^°Fn3 domain (Figure 5). This 

alignment revealed that the majority of conserved residues are found in the core of 
the beta sheet sandwich, whereas the highly variable residues are located along the 
edges of the beta sheets, including the N- and C-termini, on the solvent-accessible 
faces of both beta sheets, and on three solvent-accessible loops that serve as the 

20 hypervariable loops for affinity maturation of the antibody mimics. In view of 
these results, the randomization of these three loops are unlikely to have an 
adverse effect on the overall fold or stability of the ^"FnS framework itself 

For the human '^Fn3 sequence, this analysis indicates that, at a 
minimum, amino acids 1-9, 44-50, 61-54, 82-94 (edges of beta sheets); 19, 21, 

25 30-46 (even), 79-65 (odd) (solvent-accessible faces of both beta sheets); 21-31, 
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51-56, 76-88 (CDR-like solvent-accessible loops); and 14-16 and 36-45 (other 
solvent-accessible loops and beta turns) may be randomized to evolve new or 
improved compound-binding proteins. In addition, as discussed above, alterations 
in the lengths of one or more solvent exposed loops may also be included in such 

5 directed evolution methods. Alternatively, changes in the P-sheet sequences may 
also be used to evolve new proteins. These mutations change the scaffold and 
thereby indirectly aher loop structure(s). If this approach is taken, mutations 
should not saturate the sequence, but rather few mutations should be introduced. 
Preferably, no more than 10 amino acid changes, and, more preferably, no more 

10 than 3 amino acid changes should be introduced to the P-sheet sequences by this 
approach. 

Fibronectin Fusions 

The antibody mimics described herein may be fused to other protein 
domains. For example, these mimics may be integrated with the human immune 

15 response by fusing the constant region of an IgG (FJ with a ^^Fn3 module, 
preferably through the C-terminus of ^^Fn3. The F^ in such a ^^Fn3-F^ fusion 
molecule activates the complement component of the immune response and 
increases the therapeutic value of the antibody mimic. Similarly, a fusion between 
^^Fn3 and a complement protein, such as Clq, may be used to target cells, and a 

20 fusion between ^^Fn3 and a toxin may be used to specifically destroy cells that 
carry a particular antigen. In addition, ^^Fn3 in any form may be fused with 
albumin to increase its half-life in the bloodstream and its tissue penetration. Any 
of these fusions may be generated by standard techniques, for example, by 
expression of the fusion protein from a recombinant fusion gene constructed using 
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publically available gene sequences. 



Fibronectin Scaffold Multimers 

In addition to fibronectin monomers, any of the fibronectin constructs 
described herein may be generated as dimers or multimers of ^^Fn3 -based antibody 
5 mimics as a means to increase the valency and thus the avidity of antigen binding. 
Such multimers may be generated through covalent binding between individual 
^^Fn3 modules, for example, by imitating the natural ^Fn3-^Fn3-^^Fn3 
C-to-N-terminus binding or by imitating antibody dimers that are held together 
through their constant regions. A ^^Fn3-Fc construct may be exploited to design 
10 dimers of the general scheme of ^^Fn3-Fc::Fc-^^Fn3. The bonds engineered into 
the Fc::Fc interface may be covalent or non-covalent. In addition, dimerizing or 
multimerizing partners other than Fc can be used in ^^Fn3 hybrids to create such 
higher order structures. 

In particular examples, covalently bonded multimers may be generated 
15 by constructing fusion genes that encode the multimer or, alternatively, by 

engineering codons for cysteine residues into monomer sequences and allowing 
disulfide bond formation to occur between the expression products. Non- 
covalently bonded multimers may also be generated by a variety of techniques. 
These include the introduction, into monomer sequences, of codons corresponding 
20 to positively and/or negatively charged residues and allowing interactions between 
these residues in the expression products (and therefore between the monomers) to 
occur. This approach may be simplified by taking advantage of charged residues 
naturally present in a monomer subunit, for example, the negatively charged 
residues of fibronectin. Another means for generating non-covalently bonded 
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antibody mimics is to introduce, into the monomer gene (for example, at the 
amino- or carboxy-termini), the coding sequences for proteins or protein domains 
known to interact. Such proteins or protein domains include coil-coil motifs, 
leucine zipper motifs, and any of the numerous protein subunits (or fragments 
5 thereof) known to direct formation of dimers or higher order multimers. 

Fibronectin-Like Molecules 

Although ^^Fn3 represents a preferred scaffold for the generation of 
antibody mimics, other molecules may be substituted for ^^Fn3 in the molecules 
described herein. These include, without limitation, human fibronectin modules 

? -r 

J 10 ^Fn3-^Fn3 and ^^Fn3-^^Fn3 as well as related Fn3 modules from non-human 

animals and prokaryotes. In addition, Fn3 modules from other proteins with 
Pij sequence homology to ^^Fn3, such as tenascins and undulins, may also be used. 

Modules from different organisms and parent proteins may be most appropriate for 
;U different applications; for example, in designing an antibody mimic, it may be 

15 most desirable to generate that protein from a fibronectin or fibronectin-like 
; 3 molecule native to the organism for which a therapeutic or diagnostic molecule is 

□ intended. 



Directed Evolution of Scaffold-Based Binding Proteins 

The antibody mimics described herein may be used in any technique for 
20 evolving new or improved binding proteins. In one particular example, the target 
of binding is immobilized on a solid support, such as a column resin or microtiter 
plate well, and the target contacted with a library of candidate scaffold-based 
binding proteins. Such a library may consist of ^^Fn3 clones constructed from the 
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wild type ^^Fn3 scaffold through randomization of the sequence and/or the length 
of the ^^Fn3 CDR-like loops. If desired, this library may be an RNA-protein fusion 
library generated, for example, by the techniques described in Szostak et al., 
U.S.S.N. 09/007,005 and 09/247,190; Szostak et al, WO98/31700; and Roberts & 
5 Szostak, Proc. Natl. Acad. Sci. USA (1997) vol. 94, p. 12297-12302. 

Alternatively, it may be a DNA-protein library (for example, as described in 
Lohse, DNA-Protein Fusions and Uses Thereof, U.S.S.N. 60/110,549, U.S.S.N. 
09/459,190, and US 99/28472). The fusion library is incubated with the 
immobilized target, the support is washed to remove non-specific binders, and the 

10 tightest binders are eluted under very stringent conditions and subjected to PGR to 
recover the sequence information or to create a new library of binders which may 
be used to repeat the selection process, with or without further mutagenesis of the 
sequence. A number of rounds of selection may be performed until binders of 
sufficient affinity for the antigen are obtained. 

1 5 In one particular example, the ^^Fn3 scaffold may be used as the 

selection target. For example, if a protein is required that binds a specific peptide 
sequence presented in a ten residue loop, a single ^^Fn3 clone is constructed in 
which one of its loops has been set to the length often and to the desired sequence. 
The new clone is expressed in vivo and purified, and then immobilized on a solid 

20 support. An RNA-protein fusion library based on an appropriate scaffold is then 
allowed to interact with the support, which is then washed, and desired molecules 
eluted and re-selected as described above. 

Similarly, the ^^Fn3 scaffold may be used to find natural proteins that 
interact with the peptide sequence displayed in a ^^Fn3 loop. The ^^Fn3 protein is 

25 immobilized as described above, and an RNA-protein fusion library is screened for 
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binders to the displayed loop. The binders are enriched through multiple rounds of 
selection and identified by DNA sequencing. 

In addition, in the above approaches, although RNA-protein libraries 
represent exemplary libraries for directed evolution, any type of scaffold-based 
5 library may be used in the selection methods of the invention. 

Use 

The antibody mimics described herein may be evolved to bind any 
antigen of interest. These proteins have thermodynamic properties superior to 
those of natural antibodies and can be evolved rapidly in vitro . Accordingly, these 
antibody mimics may be employed in place of antibodies in all areas in which 
antibodies are used, including in the research, therapeutic, and diagnostic fields. In 
addition, because these scaffolds possess solubility and stability properties superior 
to antibodies, the antibody mimics described herein may also be used under 
conditions which would destroy or inactivate antibody molecules. Finally, because 
the scaffolds of the present invention may be evolved to bind virtually any 
compound, these molecules provide completely novel binding proteins which also 
find use in the research, diagnostic, and therapeutic areas. 

Experimental Results 

Exemplary scaffold molecules described above were generated and 
20 tested, for example, in selection protocols, as follows. 

Library construction 

A complex library was constructed from three fragments, each of which 
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contained one randomized area corresponding to a CDR-like loop. The fragments 
were named BC, DE, and FG, based on the names of the CDR-H-like loops 
contained within them; in addition to '^Fn3 and a randomized sequence, each of 
the fragments contained stretches encoding an N-terminal His^ domain or a 
5 C-terminal FLAG peptide tag. At each junction between two fragments (i.e., 
between the BC and DE fragments or between the DE and FG fragments), each 
DNA fragment contained recognition sequences for the Earl Type IIS restriction 
endonuclease. This restriction enzyme allowed the sphcing together of adjacent 
fragments while removing all foreign, non-'°Fn3, sequences. It also allows for a 
10 recombination-like mixing of the three ^"Fn3 fragments between cycles of 
■ 3 mutagenesis and selection. 

U Each fragment was assembled from two overlapping oligonucleotides, 

f ii which were first annealed, then extended to form the double-sfranded DNA form 

% of the fragment. The oKgonucleotides that were used to construct and process the 

- 1** 

;U 15 three fragments are Usted below; the "Top" and "Bottom" species for each 
[!^ fragment are the oUgonucleotides that contained the enthe ^^Fn3 encoding 

sequence. In these oligonucleotides designations, "N" indicates A, T, C, or G; and 
□ "S" indicates C or G. 

HfiiLbcTop (His): 

20 5'- GG AAT TCC TAA TAG GAG TCA CTA TAG GGA CAA TTA CTA TTT 
ACA ATT ACA ATG CAT CAC CAT CAC CAT CAC GTT TCT GAT GTT 
CCG AGG GAG CTG GAA GTT GTT GCT GCG ACC CCC ACC AGC-3' (SEQ 
ID NO: 1) 
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HfhLbcTop (an alternative N-teraiinus): 

5'- GG AAT TCC TAA TAG GAG TGA CTA TAG GGA CAA TTA CTA TTT 
ACA ATT AGA ATG GTT TCT GAT GTT CGG AGG GAG CTG GAA GTT 
GTT GCT GCG ACC CCC ACC AGC-3' (SEQ ID NO: 2) 

5 HFnLBCBot-flagS: 

5'- AGG GGA TGC GTT GTC GTC GTC GTC GTT GTA GTC GCT GTT CCC 
TGT TTC TCC GTA AGT GAT CCT GTA ATA TCT (SNN)7 CCA GCT GAT 
GAG TAG GCT GGT GGG GGT CGG AGG -3' (SEQ ID NO: 3) 

HFnBC3'-flag8: 

5'-AGC GGA TGC GTT GTC GTC GTC GTC GTT GTA GTC GCT GTT CCC 
TGT TTC TCC GTA AGT GAT CC-3' (SEQ ID NO: 4) 

HFnLDETop: 

5'- GG AAT TCC TAA TAG GAG TCA CTA TAG GGA CAA TTA CTA TTT 
ACA ATT ACA ATG CAT GAG CAT GAG CAT GAG CTG TTC ACA GGA 
GGA AAT AGC CCT GTC C-3' (SEQ ID NO: 5) 

HFnLDEBot-flagS: 

5'- AGC GGA TGC GTT GTC GTC GTC GTC GTT GTA GTC GCT GTT GGT 
ATA ATC AAC TCC AGG TTT AAG GCC GCT GAT GGT AGC TGT (SNN)4 
AGG CAC AGT GAA CTG CTG GAC AGG GCT ATT TCC TCC TGT -3' (SEQ 
20 ID NO: 6) 
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HFnDE3'-flag8: 

5'-AGC GGA TGC CTT GTC GTC GTC GTC CTT GTA GTC GCT CTT CGT 
ATA ATC AAC TCC AGG TTT AAG G-3' (SEQ ID NO: 7) 

HFnLFGTop: 

5 5'- GG AAT TCC TAA TAC GAC TCA CTA TAG GGA CAA TTA CTA TTT 
ACA ATT ACA ATG CAT CAC CAT CAC CAT CAC CTC TTC TAT ACC 
ATC ACT GTG TAT GCT GTC-3' (SEQ ID NO: 8) 

HFnLFGBot-flagS: 

5'-AGC GGA TGC CTT GTC GTC GTC GTC CTT GTA GTC TGT TCG GTA 
1 0 ATT AAT GGA AAT TGG (SNN) 1 0 ACT GAC AGC ATA CAC AGT GAT 
GGT ATA -3' (SEQ ID NO: 9) 

HFnFG3'-flag8: 

5'- AGC GGA TGC CTT GTC GTC GTC GTC CTT GTA GTC TGT TCG GTA 
ATT AAT GGA AAT TGG -3' (SEQ ID NO: 10) 

1 5 T7Tmv (introduces T7 promoter and TMV untranslated region needed for in vitro 
translation): 

5'- GCG TAA TAC GAC TCA CTA TAG GGA CAA TTA CTA TTT ACA ATT 
ACA-3' (SEQ ID NO: 11) 

ASAfIag8: 

20 5'- AGC GGA TGC CTT GTC GTC GTC GTC CTT GTA GTC-3' (SEQ ID NO: 
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Unispl-s (spint oligonucleotide used to ligate mRNA to the puromycin-containing 
linker, described by Roberts et al, 1997, supra): 
5LTTTTTTTTTNAGCGGATGC-3^ (SEQ ID NO: 13) 

5 A18 — 2PEG (DN A-puromy cin linker) : 
5HA)18(PEG)2CCPur(SEQIDNO: 14) 

The pairs of oligonucleotides (500 pmol of each) were annealed in 100 
^^3 |iL of 10 mM Tris 7.5, 50 mM NaCl for 10 minutes at 85T, followed by a slow 

i-^ (0.5-1 hour) cooling to room temperature. The annealed fragments with 

m 10 single- stranded overhangs were then extended using 100 U Klenow (New England 
7^ Biolabs, Beverly, MA) for each 100 |iL aUquot of annealed oligos, and the buffer 

L made of 838.5 ^il HjO, 9 ^il 1 M Tris 7.5, 5 ^il IM MgClj, 20 \i\ 10 mM dNTPs, 

and 7.5 p,l IM DTT. The extension reactions proceeded for 1 hour at 25^C. 
•;3 Next, each of the double-stranded fragments was transformed into a 

u 1 5 RNA-protein fusion (PROfusion™) using the technique developed by Szostak et 

al., U.S.S.N. 09/007,005 and U.S.S.N. 09/247,190; Szostak et al., W098/3 1700; 

and Roberts & Szostak, Proc. Natl Acad. Sci. USA (1997) vol. 94, p. 

12297-12302. Briefly, the fragments were transcribed using an Ambion in vitro 

transcription kit, MEGAshortscript (Ambion, Austin, TX), and the resulting 
20 mRNA was gel-purified and ligated to a DN A-puromy cin linker using DNA 

ligase. The mRNA-DN A-puromy cin molecule was then translated using the 

Ambion rabbit reticulocyte lysate-based translation kit. The resulting 
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mRNA-DNA-puromycin-protein PROfiision™ was purified using Oligo(dT) 
cellulose, and a complementary DNA strand was synthesized using reverse 
transcriptase and the RT primers described above (Unisplint-S or flagASA), 
following the manufacturer's instructions. 
5 The PROfusion™ obtained for each fragment was next purified on the 

resin appropriate to its peptide purification tag, i.e., on Ni-NTA agarose for the 
Hisg-tag and M2 agarose for the FLAG-tag, following the procedure recommended 
by the manufacturer. The DNA component of the tag-binding PROfiisions™ was 
amplified by PGR using Pharmacia Ready-to-Go PGR Beads, 10 pmol of 5' and 3' 
1 0 PGR primers, and the following PGR program (Pharmacia, Piscataway, NJ): Step 
ni 1 : 95°G for 3 minutes; Step 2: 95^G for 30 seconds, 58/62^G for 30 seconds, 72^G 

U for 1 minute, 20/25/30 cycles, as required; Step 3: 72''G for 5 minutes; Step 4: 

pii 4''C until end. 

The resulting DNA was cleaved by 5 U Earl (New England Biolabs) 
1 5 perl ug DNA; the reaction took place in T4 DNA Ligase Buffer (New England 
\^ Biolabs) at 37''G, for 1 hour, and was followed by an incubation at 70°G fori 5 

J minutes to inactivate Ear I. Equal amounts of the BG, DE, and FG fragments were 

□ combined and ligated to form a fiiU-length ^^Fn3 gene with randomized loops. The 

ligation required 10 U of fi-esh Earl (New England Biolabs) and 20 U of T4 DNA 
20 Ligase (Promega, Madison, WI), and took 1 hour at 37°G. 

Three different libraries were made in the manner described above. 
Each contained the form of the FG loop with 10 randomized residues. The BG and 
the DE loops of the first library bore the wild type ^^Fn3 sequence; a BG loop with 
7 randomized residues and a wild type DE loop made up the second library; and a 
25 BG loop with 7 randomized residues and a DE loop with 4 randomized residues 
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made up the third Hbrary. The complexity of the FG loop in each of these three 
libraries was 10^^; the further two randomized loops provided the potential for a 
complexity too large to be sampled in a laboratory. 

The three libraries constructed were combined into one master library in 
5 order to simplify the selection process; target binding itself was expected to select 
the most suitable library for a particular challenge. PROfusions™ were obtained 
from the master library following the general procedure described in Szostak et al., 
U.S.S.N. 09/007,005 and 09/247,190; Szostak et al, WO98/31700; and Roberts & 
Szostak, Proc. Natl Acad. Sci. USA (1997) vol. 94, p. 12297-12302 (Figure 8). 



10 Fusion Selections 

The master library in the PROfiision^^ form was subjected to selection 
for binding to TNF-a. Two protocols were employed: one in which the target was 
immobilized on an agarose column and one in which the target was immobilized 
on a BIACORE chip. First, an extensive optimization of conditions to minimize 

1 5 background binders to the agarose column yielded the favorable buffer conditions 
of 50 mM HEPES pH 7.4, 0.02% Triton, 100 |Lig/ml Sheared Salmon Sperm DNA. 
In this buffer, the non-specific binding of the ^^Fn3 RNA fusion to TNF-a 
Sepharose was 0.3%. The non-specific binding background of the ^^Fn3 
RNA-DNA to TNF-a Sepharose was found to be 0.1%). 

20 During each round of selection on TNF-a Sepharose, the Profusion'^^ 

library was first preincubated for an hour with underivatized Sepharose to remove 
any remaining non-specific binders; the flow-through from this pre-clearing was 
incubated for another hour with TNF-a Sepharose. The TNF-a Sepharose was 
washed for 3-30 minutes. 
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After each selection, the PROflision™ DNA that had been eluted from 
the sohd support with 0.3 M NaOH or O.IM KOH was amphfied by PGR; a DNA 
band of the expected size persisted through muhiple rounds of selection (Figure 9); 
similar results were observed in the two alternative selection protocols, and only 
the data from the agarose column selection is shown in Figure 9. 

In the first seven rounds, the binding of library PROfusions™ to the 
target remained low; in contrast, when free protein was translated from DNA pools 
at different stages of the selection, the proportion of the column binding species 
increased significantly between rounds (Figure 10). Similar selections may be 
carried out with any other binding species target (for example, IL-1 and IL-13). 

Animal Studies 

Wild-type ^^Fn3 contains an integrin-binding tripepetide motif, Arginine 
78 - Glycine 79 - Aspartate 80 (the "RGD motif) at the tip of the FG loop. In 
order to avoid integrin binding and a potential inflammatory response based on this 
tripeptide in vivo , a mutant form of ^^Fn3 was generated that contained an inert 
sequence. Serine 78 - Glycine 79 - Glutamate 80 (the "SGE mutanf '), a sequence 
which is found in the closely related, wild-type ^^Fn3 domain. This SGE mutant 
was expressed as an N-terminally His^-tagged, free protein in K coli . and purified 
to homogeneity on a metal chelate column followed by a size exclusion column. 

In particular, the DNA sequence encoding His6-^^Fn3(SGE) was cloned 
into the pET9a expression vector and transformed into BL21 DE3 pLysS cells. 
The culture was then grown in LB broth containing 50 |Lig/mL kanamycin at 37°C, 
with shaking, to Aj^o^l -0, and was then induced with 0.4 mM IPTG. The induced 
culture was further incubated, under the same conditions, ovemight (14-18 hours); 
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the bacteria were recovered by standard, low speed centrifugation. The cell pellet 
was resuspended in 1/50 of the original culture volume of lysis buffer (50 mM Tris 
8.0, 0.5 M NaCl, 5% glycerol, 0.05% Triton X-100, and 1 mM PMSF), and the 
cells were lysed by passing the resulting paste through a Microfluidics Corporation 
5 Microfluidizer Ml 10-EH, three times. The lysate was clarified by centrifugation, 
and the supernatant was filtered through a 0.45 [im filter followed by filtration 
through a 0.2 ^im filter. 100 mL of the clarified lysate was loaded onto a 5 mL 
Talon cobalt column (Clontech, Palo Alto, CA), washed by 70 mL of lysis buffer, 
and eluted with a linear gradient of 0-30 mM imidazole in lysis buffer. The flow 
10 rate through the column through all the steps was 1 mL/min. The eluted protein 

! ^ 

3 was concentrated 1 0-fold by dialysis (MW cutoff = 3,500) against 1 5,000-20,000 

^ PEG. The resulting sample was dialysed into buffer 1 (lysis buffer without the 

pij glycerol), then loaded, 5 mL at a time, onto a 16 x 60 mm Sephacryl 100 size 

exclusion column equilibrated in buffer 1 . The column was run at 0.8 mL/min, in 
m 15 buffer 1 ; all fi-actions that contained a protein of the expected MW were pooled, 
^ U concentrated lOX as described above, then dialyzed into PBS. Toxikon (MA) was 

O engaged to perform endotoxin screens and animal studies on the resulting sample. 

l3 In these animal studies, the endotoxin levels in the samples examined to 

date have been below the detection level of the assay. In a preliminary toxicology 
20 study, this protein was injected into two mice at the estimated lOOX therapeutic 

dose of 2.6 mg/mouse. The animals survived the two weeks of the study with no 

apparent ill effects. These results suggest that ^^Fn3 may be incorporated safely 

into an IV drug. 

Alternative Constructs for In Vivo Use 
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To extend the half Hfe of the 8 kD ^^Fn3 domain, a larger molecule has 
also been constructed that mimics natural antibodies. This ^^Fn3-Fc molecule 
contains the -CH1-CH2-CH3 (Figure 11) or -CH2-CH3 domains of the IgG constant 
region of the host; in these constructs, the ^^Fn3 domain is grafted onto the 
5 N-terminus in place of the IgG domain (Figures 1 1 and 12). Such 

antibody-like constructs are expected to improve the pharmacokinetics of the 
protein as well as its ability to hamess the natural immune response. 

In order to construct the murine form of the ^^Fn3-CHi-CH2-CH3 clone, 
the -CH1-CH2-CH3 region was first amplified from a mouse liver spleen cDNA 
1 0 library (Clontech), then hgated into the pET25b vector. The primers used in the 
1.3 cloning were 5' Fc Nest and 3' 5 Fc Nest, and the primers used to graft the 

la appropriate restriction sites onto the ends of the recovered insert were 5' Fc HIII 

fn and3*FcNhe: 

5' Fc Nest 5'GCG GCA GGG TTT GCT TAG TGG GGC CAA GGG 3^ (SEQ ID 
15 NO: 15); 

J 3' Fc Nest 5^GGG AGG GGT GGA GGT AGG TCA GAG TCC 3^ (SEQ ID NO: 

C3 16); 

3' Fc Nhe 5' TTT GCT AGC TTT ACC AGG AGA GTG GGA GGC 3' (SEQ ID 
NO: 17); and 

20 5' Fc HIII 5' AAA AAG CTT GCC AAA ACG ACA CCC CCA TCT GTC 3' 
(SEQ ID NO: 18). 

Further PCR is used to remove the CHj region from this clone and 
create the Fc part of the shorter, '*'Fn3-CH2-CH3 clone. The sequence encoding 
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^^Fn3 is spliced onto the 5' end of each clone; either the wild type ^Tn3 cloned 
from the same mouse spleen cDNA library or a modified ^^Fn3 obtained by 
mutagenesis or randomization of the molecules can be used. The oligonucleotides 
used in the cloning of murine wild-type ^^Fn3 were: 

Mo 5PCR-NdeI: 

5^ CATATGGTTTCTGATATTCCGAGAGATCTGGAG 3' (SEQ ID NO: 19); 

Mo5PCR-His-NdeI (for an alternative N-terminus with the Hisg 
purification tag): 

5* CAT ATG CAT CAC CAT CAC CAT CAC GTT TCT GAT ATT 
CCG AGA G 3^ (SEQ ID NO: 20); and 

Mo3PCR-EcoRI: 5' GAATTCCTATGTTTTATAATTGATGGAAAC3' 

(SEQ ID NO: 21). 

The human equivalents of the clones are constructed using the same 
strategy with human oligonucleotide sequences. 

- Fn3 Scaffolds in Protein Chip Applications 

The suitability of the ^^Fn3 scaffold for protein chip applications is the 
consequence of (1) its ability to support many binding fimctions which can be 
selected rapidly on the bench or in an automated setup, and (2) its superior 
biophysical properties. 

The versatile binding properties of ^^Fn3 are a fimction of the loops 
displayed by the Fn3 immunoglobulin-like, beta sandwich fold. As discussed 
above, these loops are similar to the complementarity determining regions of 
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antibody variable domains and can cooperate in a way similar to those antibody 
loops in order to bind antigens. In our system, ^^Fn3 loops BC (residues 21-30), 
DE (residues 51-56), and FG (residues 76-87) are randomized either in sequence, 
in length, or in both sequence and length in order to generate diverse libraries of 
5 mRNA-^^FnS fusions. The binders in such libraries are then enriched based on 
their affinity for an immobilized or tagged target, until a small population of high 
affinity binders are generated. Also, error-prone PGR and recombination can be 
employed to facilitate affinity maturation of selected binders. Due to the rapid and 
efficient selection and affinity maturation protocols, binders to a large number of 

10 targets can be selected in a short time. 

As a scaffold for binders to be immobilized on protein chips, the ^^Fn3 
domain has the advantage over antibody fragments and single-chain antibodies of 
being smaller and easier to handle. For example, unlike single-chain scaffolds or 
isolated variable domains of antibodies, which vary widely in their stability and 

1 5 solubility, and which require an oxidizing environment to preserve their 

structurally essential disulfide bonds, ^^Fn3 is extremely stable, with a melting 
temperature of 1 10°C, and solubility at a concentration > 16 mg/mL. The ^^Fn3 
scaffold also contains no disulfides or free cysteines; consequentiy, it is insensitive 
to the redox potential of its environment. A fiirther advantage of ^^Fn3 is that its 

20 antigen-binding loops and N-terminus are on the edge of the beta-sandwich 

opposite to the G-terminus; thus the attachment of a ^^Fn3 scaffold to a chip by its 
G-terminus aligns the antigen-binding loops, allowing for their greatest 
accessibility to the solution being assayed. Since ^^Fn3 is a single domain of only 
94 amino acid residues, it is also possible to immobilize it onto a chip surface at a 

25 higher density than is used for single-chain antibodies, with their approximately 
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250 residues. In addition, the hydrophilicity of the ^^Fn3 scaffold, which is 
reflected in the high solnbihty of this domain, leads to a lower than average 
background binding of ^^Fn3 to a chip surface. 

The stability of the ^^Fn3 scaffold as well as its suitabihty for library 
formation and selection of binders are likely to be shared by the large, Fn3-like 
class of protein domains with an immunoglobulin-like fold, such as the domains of 
tenascin, N-cadherin, E-cadherin, ICAM, titin, GCSF-R, cytokine receptor, 
glycosidase inhibitor, and antibiotic chromoprotein. The key features shared by all 
such domains are a stable framework provided by two beta-sheets, which are 
packed against each other and which are connected by at least three 
solvent-accessible loops per edge of the sheet; such loops can be randomized to 
generate a library of potential binders without disrupting the structure of the 
framework (as described above). 

Immobilization of Fibronectin Scaffold Binders ^Fn-binders") 

To immobilize Fn-binders to a chip surface, a number of exemplary 
techniques may be utilized. For example, Fn-binders may be immobilized as 
RNA-protein fusions by Watson-Crick hybridization of the RNA moiety of the 
fusion to a base complementary DNA immobilized on the chip surface (as 
described, for example, in Addressable Protein Arrays, U.S. S.N. 60/080,686; 
U.S.S.N. 09/282,734; and WO 99/51773). Altematively, Fn-binders can be 
immobilized as free proteins directly on a chip surface. Manual as well as robotic 
devices may be used for deposition of the Fn-binders on the chip surface. Spotting 
robots can be used for deposition of Fn-binders with high density in an array 
format (for example, by the method of Lucking et aL, Anal Biochem. 1999 May 
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15;270(1):103-11). Different methods may also be utilized for anchoring the 
Fn-binder on the chip surface. A number of standard immobilization procedures 
may be used including those described in Methods in Enzymology (K. Mosbach 
and B. Danielsson, eds.), vols. 135 and 136, Academic Press, Orlando, Florida, 
5 1987; Nilsson et al., Protein Expr. Purif 1997 Oct;l 1(1):1-16; and references 
therein. Oriented immobilization of Fn-binder s can help to increase the binding 
capacity of chip-bound Fn-binders. Exemplary approaches for achieving oriented 
coupling are described in Lu et al., The Analyst (1996), vol. 121, p. 29R-32R; and 
Turkova, J Chromatogr B Biomed Sci App. 1999 Feb 5;722(1-2):11-31. In 
1 0 addition, any of the methods described herein for anchoring Fn-binders to chip 

surfaces can also be applied to the immobilization of Fn-binders on beads, or other 
supports. 

Target Protein Capture and Detection 

Selected populations of Fn-binders may be used for detection and/or 
15 quantitation of analyte targets, for example, in samples such as biological samples. 

To carry out this type of diagnostic assay, selected Fn-binders to targets of interest 

are immobilized on an appropriate support to form multi-featured protein chips. 

Next, a sample is applied to the chip, and the components of the sample that 

associate with the Fn-binders are identilQed based on the target-specificity of the 
20 immobilized binders. Using this technique, one or more components may be 

simultaneously identified or quantitated in a sample (for example, as a means to 

carry out sample profiling). 

Methods for target detection allow measuring the levels of bound 

protein targets and include, without limitation, radiography, fluorescence scanning, 
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mass spectroscopy (MS), and surface plasmon resonance (SPR). Autoradiography 
using a phosphorimager system (Molecular Dynamics, Sunnyvale, CA) can be 
used for detection and quantification of target protein which has been radioactively 
labeled, e.g., using ^^S methionine. Fluorescence scanning using a laser scanner 
(see below) may be used for detection and quantification of fluorescently labeled 
targets. Alternatively, fluorescence scanning may be used for the detection of 
fluorescently labeled ligands which themselves bind to the target protein (e.g., 
fluorescently labeled target- specific antibodies or fluorescently labeled 
streptavidin binding to target-biotin, as described below). 

Mass spectroscopy can be used to detect and identify bound targets 
based on their molecular mass. Desorption of bound target protein can be 
achieved with laser assistance directly from the chip surface as described below. 
Mass detection also allows determinations, based on molecular mass, of target 
modifications including post-translational modifications like phosophorylation or 
glycosylation. Surface plasmon resonance can be used for quantification of bound 
protein targets where the Fn-binder(s) are immobilized on a suitable gold-surface 
(for example, as obtained fi"om Biacore, Sweden). 

Described below are exemplary schemes for selecting Fn binders (in this 
case, Fn-binders specific for the protein, TNF-a) and the use of those selected 
populations for detection on chips. This example is provided for the purpose of 
illustrating the invention, and should not be construed as limiting. 

Selection of TNF-a Binders Based on ^^Fn3 Scaffold 

In one exemplary use for fibronectin scaffold selection on chips, an 
^^Fn3-based selection was performed against TNF-cc, using a library of human 
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^^Fn3 variants with randomized loops BC, DE, and FG. The hbrary was 
constructed from three DNA fragments, each of which contained nucleotide 
sequences that encoded approximately one third of human ^^Fn3, including one of 
the randomized loops. The DNA sequences that encoded the loop residues listed 
above were rebuilt by oligonucleotide synthesis, so that the codons for the residues 
of interest were replaced by (NNS)n, where N represents any of the four 
deoxyribonucleotides (A, or T), and S represents either C or G. The 
C-terminus of each fragment contained the sequence for the FLAG purification 
tag. 

Once extended by Klenow, each DNA fragment was transcribed, ligated 
to a puromycin-containing DNA linker, and translated in vitro , as described by 
Szostak et al. (Roberts and Szostak, Proc. Natl, Acad. Sci USA 94:12297, 1997; 
Szostak et al., U.S.S.N. 09/007,005 and U.S.S.N. 09/247,190; Szostak et al, 
WO98/31700), to generate an mRNA-peptide fiision, which was then 
reverse-transcribed into a DNA-mRNA-peptide ftision. The binding of the 
FLAG-tagged peptide to M2 agarose separated ftiU-length ftision molecules from 
those containing frameshifts or superfluous stop codons; the DNA associated with 
the purified frill-length fusion was amphfied by PGR, then the three DNA 
fragments were cut by Ear I restriction endonuclease and ligated to form the fall 
length template. The template was transcribed, ligated to puromycin-containing 
DNA linkers, and translated to generate a ^^Fn3-PROfiision™ library, which was 
then reverse-transcribed to yield the DNA-mRNA-peptide fiision library which 
was subsequently used in the selection. 

Selection for TNF-cc binders took place in 50 mM HEPES, pH 7.4, 
0.02% Triton-X, 0.1 mg/mL salmon sperm DNA. The PROfiision^^ library was 
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incubated with Sepharose-immobilized TNF-a; after washing, the DNA associated 
with the tightest binders was eluted with 0.1 M KOH, ampHfied by PGR, and 
transcribed, hgated, translated, and reverse-transcribed into the starting material 
for the next round of selection. 

Ten rounds of such selection were performed (as shown in Figure 13); 
they resulted in a PROfusion™ pool that bound to TNF-a- Sepharose with the 
apparent average Kd of 120 nM. Specific clonal components of the pool that were 
characterized showed TNF-a binding in the range of 50-500 nM. 

Fn-binder Immobilization. Target Protein Capture, and MAL DT-TOF Detection 

As a first step toward immobilizing the Fn-binders to a chip surface, an 
oligonucleotide capture probe was prepared with an automated DNA synthesizer 
(PE BioSystems Expedite 8909) using the solid-support phosphoramidite 
approach. All reagents were obtained fi-om Glen Research. Synthesis was 
initiated with a solid support containing a disulfide bond to eventually provide a 
3 -terminal thiol fimctionality. The first four monomers to be added were 
hexaethylene oxide units, followed by 20 T monomers. The 5'-terminal DMT 
group was not removed. The capture probe was cleaved fi:om the solid support 
and deprotected with ammonium hydroxide, concentrated to dryness in a vacuum 
centrifiige, and purified by reverse-phase HPLC using an acetonitrile gradient in 
triethylammonium acetate buffer. Appropriate fractions fi:om the HPLC were 
collected, evaporated to dryness in a vacuum centrifuge, and the 5'-terminal DMT 
group was removed by treatment with 80% AcOH for 30 minutes. The acid was 
removed by evaporation, and the oligonucleotide was then treated with 100 mM 
DTT for 30 minutes to cleave the disulfide bond. DTT was removed by repeated 
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extraction with EtOAc. The oUgonucleotide was ethanol precipitated from the 
remaining aqueous layer and checked for purity by reverse-phase HPLC. 

The 3'-thiol capture probe was adjusted to 250 iiiM in degassed IX PBS 
buffer and appHed as a single droplet (75 fiL) to a 9x9mm gold-coated chip 
5 (Biacore) in an argon-flushed chamber containing a small amount of water. After 
1 8 hours at room temperature, the capture probe solution was removed, and the 
fimctionaUzed chip was washed with 50 mL IX PBS buffer (2x for 15 minutes 
each) with gentle agitation, and then rinsed with 50 mL water (2x for 15 minutes 
each) in the same fashion. Remaining liquid was carefully removed and the 

1 0 functionalized chips were either used immediately or stored at 4 ""C under argon. 

About Ipmol of ^^Fn3 fusion pool from the Round 10 TNF-a selection 
(above) was treated with RNAse A for several hours, adjusted to 5X SSC in 70 |xL, 
and applied to a functionalized gold chip from above as a single droplet. A 50 |uL 
volume gasket device was used to seal the ftision mixture with the functionalized 

1 5 chip, and the apparatus was continuously rotated at 4''C. After 1 8 hours the 

apparatus was disassembled, and the gold chip was washed with 50 mL 5X SSC 
for 10 minutes with gentle agitation. Excess hquid was carefully removed from 
the chip surface, and the chip was passivated with a blocking solution (IX TBS + 
0.02% Tween-20 + 0.25% BSA) for 10 minutes at 4°C. Excess liquid was 

20 carefully removed, and a solution containing 500 jag/mL TNF-oc in the same 
composition blocking solution was applied to the chip as a single droplet and 
incubated at 4°C for two hours with occasional mixing of the droplet via 
Pipetman. After removal of the binding solution, the chip was washed for 5 
minutes at 4°C with gentle agitation (50 mL IX TBS + 0.02% Tween-20) and then 

25 dried at room temperature. A second chip was prepared exactly as described 
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above, except fusion was not added to the hybridization mix. 

Next, MALDI-TOF matrix (15 mg/mL 
3,5-dimethoxy-4-hydroxycinnamic acid in 1:1 ethanol/10% formic acid in water) 
was uniformly applied to the gold chips with a high-precision 3-axis robot 
(MicroGrid, BioRobotics). A 16-pin tool was used to transfer the matrix from a 
3 84- well microliter plate to the chips, producing 200 micron diameter features 
with a 600 micron pitch. The MALDI-TOF mass spectrometer (Voyager DE, 
PerSeptive Biosystems) instrument settings were as follows: Accelerating Voltage 
= 25k, Grid Voltage = 92%, Guide Wire Voltage - 0.05%, Delay = 200 on, Laser 
Power = 2400, Low Mass Gate = 1 500, Negative Ions = off The gold chips were 
individually placed on a M ALDI sample stage modified to keep the level of the 
chip the same as the level of the stage, thus allowing proper flight distance. The 
instrument's video monitor and motion control system were used to direct the laser 
beam to individual matrix features. 

Figures 14 and 15 show the mass spectra from the ^^Fn3 ftision chip and 
the non-fiision chip, respectively. In each case, a small number of 200 micron 
features were analyzed to collect the spectra, but Figure 1 5 required significantly 
more acquisitions. The signal at 17.5 kDa corresponds to TNF-oc monomer. 

Fn-binder Immobilization, Target Protein Capture, and Fluorescence Detection 
Pre-cleaned 1x3 inch glass microscope slides (Goldseal, #3010) were 
treated with Nanostrip (Cyantek) for 15 minutes, 10% aqueous NaOH at TO^'C for 
3 minutes, and 1% aqueous HCl for 1 minute, thoroughly rinsing with deionized 
water after each reagent. The slides were then dried in a vacuum desiccator over 
anhydrous calcium sulfate for several hours. A 1% solution of 
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aminopropytrimethoxysilane in 95% acetone / 5% water was prepared and allowed 
to hydrolyze for 20 minutes. The glass slides were immersed in the hydrolyzed 
silane solution for 5 minutes with gentle agitation. Excess silane was removed by 
subjecting the slides to ten 5-minute washes, using fresh portions of 95% acetone / 
5% water for each wash, with gentle agitation. The slides were then cured by 
heating at 1 lO^'C for 20 minutes. The silane treated slides were immersed in a 
freshly prepared 0.2% solution of phenylene 1 ,4-diisothiocyanate in 90% DMF / 
10%) pyridine for two hours, with gentle agitation. The slides were washed 
sequentially with 90% DMF / 10% pyridine, methanol, and acetone. After air 
drying, the fimctionalized sUdes were stored at O^'C in a vacuum desiccator over 
anhydrous calcium sulfate. Similar results were obtained with commercial 
amine-reactive slides (3-D Link, Surmodics). 

Oligonucleotide capture probes were prepared with an automated DNA 
synthesizer (PE BioSystems Expedite 8909) using conventional phosphoramidite 
chemistry. All reagents were from Glen Research. Synthesis was initiated with a 
solid support bearing an orthogonally protected amino functionality, whereby the 
3 -terminal amine is not unmasked until final deprotection step. The first four 
monomers to be added were hexaethylene oxide units, followed by the standard A, 
G, C and T monomers. All capture oligo sequences were cleaved from the solid 
support and deprotected with ammonium hydroxide, concentrated to dymess, 
precipitated in ethanol, and purified by reverse-phase HPLC using an acetonitrile 
gradient in triethylammonium acetate buffer. Appropriate fractions from the 
HPLC were collected, evaporated to dryness in a vacuum centrifiige, and then 
coevaporated with a portion of water. 

The purified, amine-labeled capture oligos were adjusted to a 
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concentration of 250 |uM in 50 mM sodium carbonate buffer (pH 9.0) containing 
10% glycerol. The probes were spotted onto the amine-reactive glass surface at 
defined positions in a 5x5x6 array pattern with a 3 -axis robot (MicroGrid, 
BioRobotics). A 16-pin tool was used to transfer the hquid from 384-well 
microtiter plates, producing 200 micron features with a 600 micron pitch. Each 
sub-grid of 24 features represents a single capture probe (i.e., 24 duplicate spots). 
The arrays were incubated at room temperature in a moisture- saturated 
environment for 12-18 hours. The attachment reaction was terminated by 
immersing the chips in 2% aqueous ammonium hydroxide for five minutes with 
gentle agitation, followed by rinsing with distilled water (3X for 5 minutes each). 
The array was finally soaked in 1 OX PBS solution for 30 minutes at room 
temperature, and then rinsed again for 5 minutes in distilled water. 

Specific and thermodynamically isoenergetic sequences along the ^^Fn3 
mRNA were identified to serve as capture points to self-assemble and anchor the 
^^Fn3 protein. The software program HybSimulator v4.0 (Advanced Gene 
Computing Technology, Inc.) facilitated the identification and analysis of potential 
capture probes. Six unique capture probes were chosen and printed onto the chip, 
three of which are complementary to common regions of the ^^Fn3 fusion pool's 
mRNA (CP3\ CP5', and CPflag). The remaining three sequences (CPnegl, 
CPneg2, and CPneg3) are not complementary and function in part as negative 
controls. Each of the capture probes possesses a 3'-amino terminus and four 
hexaethylene oxide spacer units, as described above. The following is a hst of the 
capture probe sequences that were employed (5*^3*): 



CP3': TGTAAATAGTAATTGTCCC (SEQ ID NO: 22) 
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CPS': -p^-p^^^^^-p^^-p^P^PP^-p^rj, ^ggQ ^Q. 

CPneg 1 : CCTGTAGGTGTCC AT (SEQ ID NO : 24) 

CPflag: CATCGTCCTTGTAGTC (SEQ ID NO: 25) 

CPneg2: CGTCGTAGGGGTA (SEQ ID NO: 26) 

5 CPnegS: CAGGTCTTCTTCAGAGA (SEQ ID NO: 27) 



About Ipmol of ^^Fn3 fusion pool from the Round 10 TNF-a selection was 
adjusted to 5X SSC containing 0.02% Tween-20 and 2 mM vanadyl ribonucleotide 
complex in a total volume of 350 jxL. The entire volume was applied to the 
microarray under a 400 |liL gasket device and the assembly was continuously 

10 rotated for 1 8 hours at room temperature. After hybridization the slide was 

washed sequentially with stirred 500 mL portions of 5X SSC, 2.5X SSC, and IX 
SSC for 5 minutes each. Traces of liquid were removed by centrifugation and the 
slide was allowed to air-dry. 

Recombinant human TNF-a (500 {ig, lyophilized, from PreproTech) 

15 was taken up in 230 IX PBS and dialyzed against 700 mL stirred IX PBS at 
4°C for 18 hours in a Microdialyzer unit (3,500 MWCO, Pierce). The dialyzed 
TNF-a was treated with EZ-Link NHS-LC-LC biotinylation reagent (20 jig, 
Pierce) for 2 hours at 0°C, and again dialyzed against 700 mL stirred IX PBS at 
4''C for 18 hours in a Microdialyzer unit (3,500 MWCO, Pierce). The resulting 

20 conjugate was analyzed by MALDI-TOF mass spectrometry and was found to be 
almost completely functionalized with a single biotin moiety. 

Each of the following processes was conducted at 4°C with continuous 
rotation or mixing. The protein microarray surface was passivated by treatment 
with IX TBS containing 0.02% Tween-20 and 0.2% BSA (200 |uL) for 60 
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minutes. Biotinylated TNF-a (100 nM concentration made up in the passivation 
buffer) was contacted with the microarray for 120 minutes. The microarray was 
washed with IX TBS containing 0.02% Tween-20 (3X 50 mL, 5 minutes each 
wash). Fluorescently labeled streptavidin (2.5 |ig/mL Alexa 546-streptavidin 
conjugate from Molecular Probes^ made up in the passivation buffer) was 
contacted with the microarray for 60 minutes. The microarray was washed with 
IX TBS containing 0.02% Tween-20 (2X 50 mL, 5 minutes each wash) followed 
by a 3 minute rinse with IX TBS. Traces of liquid were removed by 
centrifugation, and the slide was allowed to air-dry at room temperature. 

Fluorescence laser scanning was performed with a GSI Lumonics 
ScanArray 5000 system using 10 juM pixel resolution and preset excitation and 
emission wavelengths for Alexa 546 dye. Phosphorimage analysis was performed 
with a Molecular Dynamics Storm system. Exposure time was 48 hours with 
direct contact between the microarray and the phosphor storage screen. 
Phosphorimage scanning was performed at the 50 \iM resolution setting, and data 
was extracted with ImageQuant v.4.3 software. 

Figures 16 and 17 are the phosphorimage and fluorescence scan, 
respectively, of the same array. The phosphorimage shows where the ^^Fn3 fiision 
hybridized based on the ^^S methionine signal. The fluorescence scan shows 
where the labeled TNF-a bound. 

Other Embodiments 
Other embodiments are within the claims. 

All publications, patents, and patent applications mentioned herein are 
hereby incorporated by reference. 
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Claims 



X. K protein comprising a fibronectin type III domain having at least 

/ 

one randomized loop, said protein being characterized by its ability to bind to a 
compound that is not bound by the corresponding naturally-occurring fibronectin. 

2. The protein of claim 1, wherein said fibronectin type III domain is a 
mammalian fibronectin type III domain. 

3. The protein of claim 2, wherein said fibronectin type III domain is a 
human fibronectin type III domain. 

4. The protein of claim 1, wherein said protein comprises the tenth 
module of said fibronectin type III domain (^''FnS). 

5. The protein of claim 4, wherein said compound binding is mediated 
by one ^''FnS loop. 

6. The protein of claim 4, wherein said compound binding is mediated 
by two ^'^FnS loops. 

7. The protein of claim 4, wherein said compound binding is mediated 
by three '*'Fn3 loops. 

8. The protein of claim 4, wherein the second loop of said ^''FnS is 
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extended in length relative to the naturally- occurring module. 

9. The protein of claim 4, wherein said '"FnS lacks an integrin-binding 

motif. 

1 0. The protein of claim 9, wherein said integrin-binding motif is 
replaced by an amino acid sequence comprising a basic amino acid-neutral amino 
acid-acidic amino acid motif. 

11. The protein of claim 10, wherein said integrin-binding motif is 
replaced by an amino acid sequence comprising serine-glycine-glutamate. 

12. The protein of claim 1, wherein said protein lacks disulfide bonds. 

1 3 . The protein of claim 1 , wherein said protein is part of a fusion 

protein. 

14. The protein of claim 13, wherein said fusion protein further 
comprises an immunoglobulin domain. 

15. The protein of claim 13, wherein said fusion protein further 
comprises a complement protein. 

16. The protein of claim 13, wherein said fusion protein further 
comprises a toxin protein. 
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17. The protein of claim 13, wherein said fusion protein further 
comprises an albumin protein. 

18. The protein of claim 1, wherein said protein is covalently bound to a 
nucleic acid. 

19. The protein of claim 18, wherein said nucleic acid encodes said 

protein. 

20. The protein of claim 18, wherein said nucleic acid is RNA. 

21. The protein of claim 1 or 18, wherein said protein is immobilized on 
a solid support. 

22. The protein of claim 21, wherein said protein is immobilized on 
said solid support as part of an array. 

23. The protein of claim 21, wherein said solid support is a chip or 

bead. 

24. The protein of claim 1, wherein said protein is a multimer. 

25. The protein of claim 1 or 9, wherein said protein is formulated in a 
physiologically-acceptable carrier. 
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26. A nucleic acid encoding the protein of claim 1 or 4. 



27. The nucleic acid of claim 26, wherein said nucleic acid is DNA. 



28. The nucleic acid of claim 26, wherein said nucleic acid is RNA. 



29. A method for generating a protein comprising a fibronectin type III 
domain ^^kfclfis pharmaceutically acceptable to a mammal, said method 
comprising removing an integrin-binding domain from said fibronectin type III 
domain. 



30. The method of claim 29, wherein said integrin binding motif is 
replaced by an amino acid sequence comprising a basic amino acid-neutral amino 
acid-acidic amino acid motif. 



3 1 . The protein of claim 30, wherein said integrin-binding motif is 
replaced by an amino acid sequence comprising serine-glycine-glutamate. 



32. The method of claim 29, wherein said at least one loop of said 
fibronectin type III domain is randomized. 



33. The method of claim 29, wherein said protein comprises the tenth 
module of said fibronectin type III domain. 



34. The protein of claim 29, wherein said protein is part of a fixsion 
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protein. 



35. The protein of claim 34, wherein said fusion protein further 
comprises an immunoglobuhn domain. 

36. The protein of claim 34, wherein said fusion protein further 
comprises a complement protein. 

37. The protein of claim 34, wherein said fusion protein further 
comprises a toxin protein. 

38. The protein of claim 34, wherein said fusion protein further 
comprises an albumin protein. 

39. The method of claim 29, wherein said mammal is a human. 

40^ A method for obtaining a protein which binds to a compound, said 
method comprising: 

(a) contacting said compound with a candidate protein, said candidate 
protein comprising a fibronectin type III domain having at least one randomized 
loop, said contacting being carried out under conditions that allow compound- 
protein complex formation; and 

(b) obtaining, from said complex, said protein which binds to said 
compound. 
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44 . A method for obtaining a compound which binds to a protein, said 
protein cbmprising a fibronectin type III domain having at least one randomized 
loop, said method comprising: 

(a) contacting said protein with a candidate compound, said contacting 
being carried out under conditions that allow compound-protein complex 
formation; and 

(b) obtaining, from said complex, said compound which binds to said 

protein. 



42. The method of claim 40, said method further comprising 
randomizing at least one loop of said fibronectin type III domain of said protein 
obtained in step (b) and repeating said steps (a) and (b) using said further 
randomized protein. 



43. The method of claim 41, said method further comprising modifying 
said compound obtained in step (b) and repeating said steps (a) and (b) using said 
further modified compound, 

44. The method of claim 40 or 41, wherein said compound is a protein. 

45. The method of claim 40 or 41, wherein said fibronectin type III 
domain is a mammalian fibronectin type III domain. 



46. The method of claim 45, wherein said fibronectin type III domain is 
a human fibronectin type III domain. 
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47. The method of claim 40 or 41, wherein said protein comprises the 
tenth module of said fibronectin type III domain (^^Fn3). 

48. The method of claim 47, wherein said compound binding is 
mediated by one ^^Fn3 loop. 

49. The method of claim 47, wherein said compound binding is 
mediated by two ^^Fn3 loops. 

50. The method of claim 47, wherein said compound binding is 
mediated by three ^^Fn3 loops. 

51. The method of claim 47, wherein the second loop of said ^^Fn3 is 
extended in length relative to the naturally-occurring module. 

52. The method of claim 47, wherein said ^^Fn3 lacks an integrin- 
binding motif. 

53. The method of claim 40 or 41, wherein said protein is covalently 
bound to a nucleic acid. 

54. The method of claim 53, wherein said nucleic acid encodes said 

protein. 

55. The method of claim 53, wherein said nucleic acid is RNA. 
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56. The method of claim 40, wherein said compound is immobilized on 
a solid support. 

57. The method of claim 41, wherein said protein is immobilized on a 
solid support. 

58. The method of claim 56 or 57, wherein said solid support is a 
column or microchip. 

/A method for detecting a compound in a sample, said method 

co™pns»g: 

(a) contacting said sample with a protein which binds to said compound 
and which comprises a fibronectin type III domain having at least one randomized 
loop, said contacting being carried out under conditions that allow compound- 
protein complex formation; and 

(b) detecting said complex, thereby detecting said compound in said 

sample. 

60. The method of claim 59, wherein said protein is immobilized on a 
solid support. 

61. The method of claim 60, wherein said protein is immobilized on 
said solid support as part of an array. 
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62. The method of claim 60, wherein said solid support is a chip or 

bead. 

63. The method of claim 59, wherein said protein is covalently bound to 
a nucleic acid. 

64. The method of claim 59, wherein said nucleic acid encodes said 

protein. 

65. The method of claim 64, wherein said nucleic acid is RNA. 

66. The method of claim 59, wherein said compound is a protein. 

67. The method of claim 59, wherein said compound is detected by 
radiography, fluorescence detection, mass spectroscopy, or surface plasmon 
resonance. 
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PROTEIN SCAFFOLDS FOR ANTIBODY MIMICS 
AND OTHER BINDING PROTEINS 
Abstract of the Disclosure 
Disclosed herein are proteins that include a fibronectin type III domain 
having at least one randomized loop. Also disclosed herein are nucleic acids 
encoding such proteins and the use of such proteins in diagnostic methods and in 
methods for evolving novel compound-binding species and their ligands. 



\\Ceserver\documents\50036\50036.021003 CIP Application.wpd 



-55- 



1 n 

m 
m 

p 

J; 




. 55 



m 
m 

u 

■I 

hi 

• IT- 



FIG. 3 



X o = w n n tD 

O W M n rr 



c H T n o 

^0-5 £ Z »- > 

O ^>J 



o 

o 



X O =: 

<6 O 
O 0 



c 
c 



0 tfl 

ft "a 



tfl c 

3 

o 
c 

c 

(A 



0) t*j n to 

c ^ b Q 

»fl c a t: 

tfl (n to (fl 
0 

0 b) Qi Ct 

tr 3 c 
c 

^. fii 



to 



C H H 

s n A 
a 3 3 

C Di fil 

Ifl (« 

o o 

3 >- 
3 3 

O 73 
n 
m 
n 
c 

M 

tfl 

0 



cr o- 

o o 

3 3 

n n 

rr rr 

3 3 



«< 



Cl 
o 



O Q 



fii flj 
3 3 
»< I- 

n 
c 

w 
o 
n 



< 



> n 3- 3- a O 

•n cr 3 tfl h <a t 

O 3 (D 
3 

n 
»— 

Qi 
C 

A 

a 



O 



M 



o 

X 

o 

in 

O 
Q> 
Ul 
(i 



D 

O 

r 
o 



13 






3 




0 






0 


a 




& 




3 


0 




3 


b 


1 


*^ 


n- 


a 


n 


n 


rr 








0 


H- 


on 


0 


n 


ns 


n 

Cj 






in 


n 




0 


D 












< 


< 


rr 




«c 






0 


3 


(? 




rr 




0 




u 






3 






< 


W 


1 










0 


Of 






c 


0 


E 


0 










5- 


A 


rr 




u 




n 






rr 




< 


0 


tS 








3 




rr 




rr 




C 




< 


0 


c 
0 


rr 




0 






0 




to 




c 


3 






0 






C 


{D 


0 






c 










rr 


AOt 


•< 






rr 










c 










rr 




0 








C 






C 











2 IQ 

r^ r « < 



H CJ 

< r 



13: 



> < 

t" < M 

Ik 



r HI 

w n o 
IK 

< M 

l> 

w z o 



lu 



" 2 



r r H H s 
& * •-• ^ 
— — -c "O c 
■r < < o 
3 3 SI ^ ^* 

6 » < <*C 

««<*-< o 

7 O (A cn I 

r r v t:* f 

^- X" S 3 PI 

r* ^ ^ < 

c. c a > 
< < < < < 
H u n o H 

0 ~ ^ •o ^ 
3 M H H H 

01 Cl H H H 

2 3 3 S 

7 ;3 = := o 



S 5 > > < 

I I o a c 

•9 > > < 

» ?C 0» W H 

O C 3 3 O 

> I I I I 

r r t* t* r 



< < < 

01 (0 tn 

O c o 

< < ^ 
•0 ^ ^ 

ggg 



< < < 
CA (n CO 
000 

< < < 

•8 ilj m 

ggg 

t I t 



< > < 
?c O 

S S £ 
C 3 O 

> Ci -5 
I 1 I 

< < < 
•3 > o 
ceo 

> > > 



< r* 

o 

o a 
> < 



S ^ c c ^ 

« ffl > 7» w 

< < H H trt 
H H H H M 
ti <b *a ^ ^ 
ft (6 3 3 a; 

-5 »^ W W H 

< < t» r r 
£ Z t* r* H 

r* r* M < M 

>> CO CO CO 

X S 3: X X 

3 3 ^ -O > 

I I *0 "0 *Q 

O A »^ > 

3 3: > > < 

M M 1^ h|. tfl 

< < ^ M < 



Si 15 K K w K 
2 s s ^ < 

P H Z HI 1*1 



T3 > > 

H CO C/1 

CO M H 

•Q ig ig 

HUH 

M (0 CO 

r t* 

n r t* 



^ < S 

^< ^ < 

M H H 

»a ,^ 

H H H ^ 

CO CO to _ 

r r S 

r f fi o 



to H H W 

C CI O O 

K >< K K 

r" s ^ ^ < 

H r r < 

r" r cn to 

K K K K 

> > ST jr 

•r -B u 

r r < < 



0 H H 
o n 

•< K 

:- c" 

w < < 
K K K 
< -4 H 

1 I I 



J3 r3 5^ 

0 O K 
K K K 

« 3D 

»H M M 

3? H 

K K K 

(D <t CO 

1 I I 
t I I 



H H 

C A 

> > 

Ci > 

c a 

w w 



o c 

H H 2 

n jQ 
•3 -3 51 
?5 ^ < 

•3 n n 



I 
t 

to 

> 



I I 3r ttO 

=: = n n Cl 

W 5? CO CO O 

C. O 3" 

Cl O ^ "tg O 

^ H- pn p, 

n PI n n PI 

Z Z < ^ X 

c o < < n 



H M M »H M M 

to to CO to CO CO 

§X X S X X 

^ p] PI n o o 

•0 fQ fQ «a *Q ni 

T ^ ^ ^ ^ ^ 

H H CO CO H H 

*i < < < 3 o 

2 » » 

» 5B 5B 

Ht H W 

HUH 

K K K 

t I I 

I I I 

W w 

H H H 

CO z ai 




CO 

< 
10 
n 



r 

£0 
n 



< 
K 

I 

> 

CO 

c 

< 
-J 

J2 



O < t" 

?r a n ft 

»- < r < 

^ TT CD *Q 

?: r" "3 

> s H n 

CO a < > 

-5 < s o 

(5 H a n 

r r 2 



2 

i 

3 

ri 
•< 

H 
< 



K 
> 

I 




Csrs — Z585S 

c^cocococofinMcn 

<><<r-r'W« 
Hcncoco — •-•cnoi 



o > 



I 
I 



< < « jr C- < 

3: X H H 

H « < 

i. 2. « H 

r: r: = *< 

I I H H 



CO 

< 

TO 

n 

3 

o 

CO 
9C 
CO 



www 

H H H 

« 2 « 1^ 

n o o 

X CO 2 'J 

CO CO CO 

^ ^ tw 

< 5 < 1^ 

<0 S m 

K « ft ® 




13 •0 nj 

CO CO w *; 

X W 

CO CO CO _^ 

H H H f; 

> > > ® 




§ 
O 
CO 

I 

03 
»0 



s s s 

n c 0 

p p o 

M cn M 

^ ^3 IQ 

> 

CA CA 

CO to CO 

^ ^ ZK, 

*Q ^ 



< < < w 
CO CO CO CO 

X X Z X 
K K K K ^ 

S <3 5 50 « 

H H H H ^ 

o 
o 



U V» yi o O 
N> s cn o a cs 

^ *0 ON CN 



c\ w 




0 




H 






His^10Fn3 



10Fn3 




mRNA-His.iOFn3 



iOFn3 




mRNA-iOFn3 



Figure 8 



50bp+ W H1H2H3 1 2 3 




Round 1 (30 cycles) 

1 2 3 4 5 6 50 bp 




UV E 
50bp 1 2 3 12 3 




Round 2 (30 cycles) 
1 2 3 4 5 6 50bp 




Round 4 (30 cycles) 



Round 5 (30 cycles) 



1 2 3 4 5 6 W 50 bp 




Round 3 (30 cycles) 
LW 1 2 3 4 5 



Round 6 (20 cycles) 



360 bp 




360 bp 



Figure 9 




Figure 10 




i«Fn3-CH2-CH3 ioFn3-CHi-CH2-CH3 



Figure 11 




'Z 



Figure 12 



2000-1 



a 



1500- 



1000-1 



li 



li 



500 



\ 



V 



10000 



15000 



20000 



25000 50000 



2000- 



1500- 



1000- 



500- 



V 




10000 



15000 



20000 



25000 



"1 — 
30000 



CPS' 



CP3* 



CPnegl 




CPflag CPneg2 CPnegS 



CPS' CPS' CPnegl 



CPflag CPneg2 CPnegS 



PATENT 

ATTORNEY DOCKET NO: 50036/021003 
COMBINED DECLARATION AND POWER OF ATTORNEY 

As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name. 

I believe I am the original, first and sole inventor (if only one name is listed below) or an original, first and 
joint inventor (if plural names are listed below) of the subject matter which Is claimed and for which a 
patent is sought on the invention entitled PROTEIN SCAFFOLDS FOR ANTIBODY MIMICS AND OTHER 
BINDING PROTEINS, the specification of which 

■ is attached hereto. 

□ was filed on as Application Serial No. 

and was amended on . 

□ was described and claimed in PCT International Application No. 

filed on and as amended under PCT Article 19 on . 



I hereby state that I have reviewed and understand the contents of the above-identified specification, 
including the claims, as amended by any amendment referred to above. 

I acknowledge the duty to disclose all information I know to be material to patentability in accordance with 
Title 37, Code of Federal Regulations, §1. 56(a). 

FOREIGN PRIORITY RIGHTS: I hereby claim foreign priority benefits under Title 35, United States Code, 
§1 19 of any foreign application (s) for patent or inventor's certificate or of any PCT international 
application(s) designating at least one country other than the United States of America listed below and 
have also identified below any foreign application for patent or inventor's certificate or any PCT 
international application(s) designating at least one country other than the United States of America filed 
by me on the same subject matter having a filing date before that of the application(s) of which priority is 
claimed: 



Country 


Serial Number 


Filing Date 


Priority Claimed? 








Yes/No 



PROVISIONAL PRIORITY RIGHTS: I hereby claim priority benefits under Title 35, United States Code, 
§1 19(e) and §120 of any United States provisional patent application(s) listed below filed by an inventor or 
inventors on the same subject matter as the present application and having a filing date before that of the 
appiication(s) of which priority is claimed: 



Serial Number 


Filing Date 


Status 


60/111,737 


December 10, 1998 


Abandoned 



COMBINED DECLARATION AND POWER OF ATTORNEY 

NON-PROVISIONAL PRIORITY RIGHTS: I hereby claim the benefit under Title 35, United States Code, 
§120 of any United States application(s) listed below and, insofar as the subject matter of each of the 
claims of this application is not disclosed in the prior United States application in the manner provided by 
the first paragraph of Title 35, United States Code, §1 12, 1 acl<nowledge the duty to disclose ail information 
I know to be material to patentability as defined in Title 37, Code of Federal Regulations, §1 .56(a) which 
became available between the filing date of the prior application and the national or PCT international filing 
date of this application: 



Serial Number 


Filing Date 


Status 


09/456,693 


December 9, 1999 


Pending 



I hereby appoint the following attorneys and/or agents to prosecute this application and to transact all 
business in the Patent and Trademark Office connected therewith: Paul T. Clark, Reg. No. 30,162, Karen 
L. Elbing, Ph.D. Reg. No. 35,238, Kristlna Bieker-Brady, Ph.D. Reg. No. 39,109, Susan M. Michaud, Ph.D. 
Reg. No. 42,885, Mary Rose Scozzafava, Ph.D., Reg. No.36,268, James D. DeCamp, Ph.D., Reg. No. 
^ 43,580. 

i Address ail telephone calls to: Karen L. Elbing, Ph.D. at 617/428-0200. 

t Address all correspondence to: Karen L. Elbing, Ph.D. at Clark & Elbing LLP, 1 76 Federal Street, Boston, 
T MA 021 10. 

^ I hereby declare that all statements made herein of my own knowledge are true and that all statements 
a made on information and belief are believed to be true; and further that these statements were made with 
the knowledge that willful false statements and the like so made are punishable by fine or imprisonment, 
3 or both, under Section 1001 of Title 18 of the United States Code and that such willful false statements 
LJ may jeopardize the validity of the application or any patents issued thereon. 



Full Name 

(First, Middle, Last) 


Residence Address 
(City, State, Country) 


Post Office Address 
(Street, City, State, Country) 


Citizenship 


Dasa Lipovsek 


45 Sunset Road 
Cambridge, MA 02138 


45 Sunset Road 
Cambridge, MA 02138 


Slovenia 


Signature: 




Date: 




Full Name 

(First, Middle, Last) 


Residence Address 
(City. State, Country) 


Post Office Address 
(Street, City, State, Country) 


Citizenship 


Peter Lohse 


50 Golden Ball Road 
Weston, MA 02493 


50 Golden Ball Road 
Weston, MA 02493 


Switzerland 


Signature: 




Date: 



-2- 



